ON THE EXPECTED NUMBER OF ZEROS OF 
NONLINEAR EQUATIONS 



GREGORIO MALAJOVICH 



Abstract. This paper investigates the expected number of roots 
of nonhnear equations. Those equations are assumed to be ana- 
lytic, and to belong to certain inner product spaces. Those spaces 
are then endowed with the Gaussian probability distribution. 

The root count on a given domain is proved to be 'additive' 
with respect to a product operation of functional spaces. This 
allows to deduce a general theorem relating the expected number 
of roots for unmixed and mixed systems. Examples of root counts 
for equations that are not polynomials nor exponential sums are 
given at the end. 

1. Introduction 

We consider systems of analytic equations of the form 

/i(x) = --- = /„(x) = 

where x is assumed to belong to a complex n- dimensional manifold M. 
Each fi belongs to a certain complex inner product space J'j. Those 
will be called spaces of complex fewnomials, or fewspaces for short (see 
definition E]). 

Let nM(f) denote the number of isolated roots of the system above. 
More generally, let nx{i) be the number of isolated roots in a set %. 
A consequence of Brouwer's degree theorem is that when % is open, 
the number nx{i) is lower semi-continuous as a function of f (details 
in [221 Ch.3]). 

When the 5", are spaces of polynomials (resp. Laurent polynomials) 
and M = C"" (resp. M = (C \ {0})""), the number njv-/(f) is known to 
be be equal to its maximum generically, that is for all f except in a 
codimension 1 (hence measure zero) variety. Bounds for this maximum 
are known, and some of them are exact. 
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For instance, let 3^a be the set of Laurent polynomials with support 
A, viz. 

f(v\ — \ f ^'^l^'^2 . . . t'^" 
aeA 

where A C Z" is assumed to be finite and fa € C. The inner product 
in 3^A is arbitrary. Let A denote the convex hull of A. 

Theorem 1 (Kushnirenko [19J). Let /i, ■ ■ ■ , /„ G 3^a- For a generic 
choice of coefficients fia G C, 

n(c\{o})"(f) = n\ Vol(^). 

The case n = 1 was known to Newton, and n = 2 was published by 
Minding [3l] in 1841. A system as above, where all the equations have 
same support A is said to be unmixed. Otherwise, the system is said 
to be mixed. The following root count for mixed polynomial systems 
was published by Bernstein [4J and is known as the BKK bound (for 
Bernstein, Kushnirenko and Khovanskii) [5j: 

Theorem 2 (Bernstein). Let Ai, . . . , A„ C Z" be finite sets. Let Ai be 
the convex hull of A^. For a generic choice of coefficients fia G C, 

'^(cuo})" (f ) 

is n\ times the coefficient V o/ Ai . . . A„ in the polynomial 

\Yo\{\iAi + --- + XrAn)- 

This number V is known as the mixed volume of the tuple of convex 
bodies {Ai, . . . , A„). 

The objective of this paper is to extend the results above to more 
general spaces of analytic equations. For instance, we would like to 
count zeros of equations such as 

/OO + fQlX + fo2X^ + ■■■ + fodX'^+ 

+/ioe"' + foixe^ + fo2x'e^ + ■■■ + fodX^e^ = 0. 

It is easy to see that the number of solutions in C for (say) d = is 
infinite. However, we can inquire about the number of solutions in a 
smaller set, like the disk D = {a; G C : |a;| < 1}. 

Instead of counting the generic number of zeros (that exists no more), 
we endow the space of equations with a probability measure (zero av- 
erage, unit variance normal distribution) and compute the expected 
number of isolated roots. 

In the example above, the expected root count is 

E(ni,(/)) = d/2 + 0.202, 918, 921, 282 ■• • 
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(see Section m for the precise inner product we are using). The constant 
0.202 ■ ■ • was obtained numerically. I would like to thank Steven Finch 
for pointing out an error in the 4-th decimal of a previous computation, 
and giving the correct decimal expansion. 

This and other examples are worked out in Section [6] 

It turns out that complex fewnomial spaces are reproducing kernel 
spaces. A meaningful multiplication operation between reproducing 
kernel spaces was studied by Aronszajn ^ (see Section H]). We denote 
the product space of 5" and S by J'S, and the A-th power of 5" by 3^^. 
The main result in this paper is an analogous to Bernstein's theorem. 
However, there is no more an interpretation of the number of roots in 
terms of a volume of a convex body (Minding and Kushnirenko) or in 
terms of mixed volume. But the relation between root counts in mixed 
and unmixed systems is preserved. 

Theorem 3. Let 3^1, . . . , be finite dimensional fewspaces of func- 
tions of M, endowed with the zero average unit variance normal prob- 
ability distribution. Let % (1 M be measurable. Then, 

K(f)) 

is the coefficient o/ A1A2 ■ ■ • A„ in the n-th degree homogeneous polyno- 
mial 

g-^Ai A2, A„(n3^(g)) 

where zero average and unit variance normal probability distribution is 
assumed in each 3^^^ 3^2^ ■ ■ ■ 3^^". 

In the setting of Bernstein's theorem, one may identify S^AiAi+.-.+A^An 
to 3^Ai'^A2 ' ' ' -^yC- With this identification, Bernstein's theorem follows 
immediately from Kushnirenko's theorem and Theorem [3l 

The basic idea for the proof of Theorem [3] is: 

Lemma 4. Let £, 3^1, 3^2, • • • , 3^n be finite dimensional fewspaces and 
let X > be integer. Then, 

%e£^:fi,/2GJ2,...,/ne:f„("'K(f)) = %e£,/2e:F2,...,/ne5„(^^K(f)) + 

+'^%eJi,/2e:f2,...,/neJ„('^3c(f)). 

Above, all fewspaces are assumed with the zero average, unit variance 
normal probability distribution. 
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2. Related work 



Random polynomial systems constitute a classical subject of studies, 
and received a lot of attention lately (See for instance the book by 
Azai's and Wschebor |3J and references). Part of the interest comes 
from the study of algorithms for solving polynomial systems such as 
in [261430] . The running time of algorithms can be estimated in terms 
of certain invariants, such as the number of real or complex zeros, 
and the condition number. While the number of real zeros of real 
polynomial systems and the condition number depend on the input 
system, it is possible to obtain probabilistic complexity estimates by 
endowing the space of polynomials with a probability distribution, and 
then treating those quantities as random variables. For the full picture, 
see the book [6] and two forthcoming books [8l[22]. Recent papers on 
the subject include p!| [T0] - [T2] . The extension of this theory to systems 
of sparse polynomial systems started with [23l[2lj (see below) and is 
still a research subject (see |22]). 

Another source of interest comes from classical asymptotic estimates 
such as in Littlewood-Offord |20l|21] and Kac mut]. 

Asymptotic formulas for the number of roots of sparse polynomial 
systems can be obtained by scaling the supports. For instance, one 
looks at systems of Laurent polynomials such as 



where t is a scaling parameter. A random variable of interest in the 
zero- dimensional case is t~"nM(f). In [25], Shiffman and Zelditch gave 
asymptotic formulas for the root density in terms of the mixed volume 
form. 

Kazarnovskii [17j obtained more general formulas. He considered 
fewnomials that are (after multiplying variables by v^—T) Fourier trans- 
forms of distributions supported by real compact sets. For instance, ([T]) 
is the Fourier transform of a distribution with support {0, 1}, namely 



The convex bodies that appear in the Kushnirenko and Bernstein 
theorems are replaced by the convex hull of the support of the dis- 
tributions. In this sense, he generalized Bernstein's theorem to non- 
polynomials and non-exponential-sums. However, his bounds for (say) 



ta 



i=0,l 
j=0,...,d 
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([I]) do not take into account different values of d. That is why those 
bounds must be asymptotic. 

3. Spaces of complex fewnomials 

Let M be an n-dimensional complex manifold. In this section we 
review part of the theory of spaces of complex fewnomials in M. This 
theory is developed in more details in |22j. Canonical references for 
analytic functions of several variables and for reproducing kernel spaces 
are, respectively, [TS] and [2]. 

Definition 5. A complex fewnomial space (or fewspace for short) of 
functions over a complex manifold M is a Hilbert space of holomorphic 
functions from M to C such that the following holds. Let V : M ^ 3^* 
denote the evaluation form l^(x) : / i— )■ /(x). For any x e M, 

(1) V{x.) is a continuous linear form. 

(2) V{x.) is not the zero form. 

In addition, we say that the fewspace is non- degenerate if and only if, 
for any x G M, 

3- Pv{x.)DV{'x) has full rank, 

where Pw denotes the orthogonal projection onto W~^. (The derivative 
is with respect to x). In particular, a non-degenerate fewspace has 
dimension > n + 1. 

Example 6. Let M be an open connected subset of C". Bergman 
space A{M) is the space of holomorphic functions defined in M with 
finite C"^ norm. The inner product is the £^ inner product. When M 
is bounded, A{M) contains constant and linear functions, hence it is a 
non-degenerate fewspace. 

Remark 7. Condition 1 holds trivially for any finite dimensional fewno- 
mial space, and less trivially for subspaces of Bergman space. 

To each fewspace 5" we associate two objects: The reproducing kernel 
if (x, y) = Kj-^-x, y) and a possibly degenerate Kahler form uj = ojj- on 
M. 

Item (II]) in the definition makes ^(x) an element of the dual space 
3^* of 5" (more precisely, the space of continuous functionals 5" — )■ C) . 

Riesz-Frechet representation Theorem (e.g. [7] Th.V.5 p. 81) allows 
to identify 5" and 3"*, whence the Kernel ii'(x, y) = (V(x)*)(y). 

For fixed x, K{'x, y) G as a function of y. 

By construction, for / G 

/(y) = (/(-),^(-,y))- 
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There are two consequences. First of all, 

ir(y, x) = x), K{-, y)) = y), x)) = 

and in particular, for any fixed y, x h-;- -ft'(x, y) is also an element of 
5". Thus, -ft'(x, y) is analytic in x and in y. Moreover, ||i^(x, ■)||^ = 
K(x,x). 

Secondly, -D/(y)y = (/(■)) DyK{- ,y)y) and the same holds for higher 
derivatives. 

Because of Definition [5]([2]), K{-,y) ^ 0. Thus, y ^ K{-,y) induces a 
map from M to P(3'). The differential form u is defined as the pull-back 
of the Fubini-Study form 

'^/ = ^5^iogii/ir 

ofP(3^) hjy^K{;y). 
Namely, 

(2) (Wg-)^ = Wx = -^^^^^ log ^(^' ^) • 

When the form u is non-degenerate for all x G M, it induces a 
Hermitian structure on M. This happens if and only if the fewspace is 
a non-degenerate fewspace. 

Remark 8. If 3" = yi(M) is the Bergman space, the kernel obtained 
above is known as the Bergman Kernel and the metric induced by u 
as the Bergman metric. 

Remark 9. If denotes an orthonormal basis of 3" (finite or infinite), 
then the kernel can be written as 

i^(x,y) = 5^0,,(x)0:M- 

Let nx{f) be the number of isolated zeros of / that belong to a 
measurable set %. The following result is well-known. It appears 
in IW, Prop. 3] and [13, Prop-Def.l.6A]. It is a consequence of Crofton's 
formula, also knwon as Rice formula or coarea formula. 

Theorem 10 (Root density). Let % be a locally measurable set of an 
n- dimensional manifold M. Let 3^i, . . . ,3^n be fewspaces. Let oji, . . . ,ujn 
be the induced symplectic forms on M. Assume that f = /i, ...,/„ zs 
a zero average, unit variance variable in 3^ = x ■ ■ ■ x 3^^- Then, 
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As the formulation in terms of reproducing kernel spaces is not stan- 
dard, we sketch the proof below (more details are available in [2^ 
Th.5.11]). 

Proof. First of all, let V = {(f , x) G 9^ x X : f (x) = 0} be the incidence 
variety, and tti : V — t- 5", : V — t- X be the canonical projections. 

In a neighborhood of each regular point (fg, Xq) of V, it is possible to 
parametrize V by an implicit function (G'(f),f) with G'(fo) = Xq and 

DG(xo) = -/^f (xo)-i (iri(xo, •)* ® ■ ■ ■ 8) i^n(xo, ■)*) 

where Ki is the reproducing kernel of 3^i. 

Let J'x denote the product _ft'i(x, ■)-'- x ■ ■ ■ x -ft'n(x, ■)-'-. 

The coarea formula is now 

nnxii)) = 7ir4^ /fco7rr^({)e-llfllV2dyHf) 

dVMi^) [ Arj(f,x)-2e-l'f|''/2drjjf) 



i2n 



idim3" 



with A^J = det (DG(x)DG(x)*)^/^ = | det (x)|-i n(^i(x, x))i/2. 
The reader may check that 



det|Df(x)|W= /\ ^ A/^(x) A/.(x) xIldxjAdxk 
i=i" j,k=i ^ ^ 

(otherwise, this is Lemma 4.3 in [22] )■ At this point, 
with 



The result below implies that fli = 2u!i, concluding the proof of the 
density theorem. □ 

Proposition 11. Let (u, w)j^x = i^j,x(u, Jw) be the (possibly degener- 
ate) Hermitian product associated to uji. Then, 



_1 f (D/.(x)u)D/.(x)w e-ll/^IP 
Proof of Proposition {TJi Let 



Ki{^, x) 
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be the orthogonal projection. Since the inner product (■, ■)i is the pull- 
back of Fubini-Study by x i— t- irj(x, ■), we can write the left-hand-side 
as: 

(P,DA^,(x,>,P,Dir,(x,-)w) 



(u, w)i 



Ki{:>c, x) 



For the right-hand-side, note that 

D/,(x)u =(/,(•), I^K,(-,x)u) = {f,{-),P^DK,{;^)u). 

Let U = p^P,Dir,(.,x)u and W = ^^^P,Dir(., x)w. Both 
U and W belong to The right-hand-side is 



If (Z}/.(x)u)D/.(x)w e-IIMI (f^-\( If in^TwV '~ 

2^(x,.)- ll^.(x,x)P (2^)din.:..-i d^^.(x,-)-U.J - 2^(.,.)x^^^'''^^^'''^^27r)'i-^> 

= i(U,W) / f |.|VN^/M. 

2 27r 



= (U,W) 

which is equal to the left-hand-side. □ 

4. Product spaces 

Let E and F be complex inner product spaces. If e G E and / G 
F, we denote by e ® / the class of equivalence of pairs (e, /) under 
(Ae, /) ~ (e, A/). The tensor product of E and F is the space of all 
linear combinations of elements of the form e ® /. In the case E and F 
are finite dimensional, E ® F can be assimilated to the space of bilinear 
maps E X F — )■ C. 

The canonical inner product for the tensor product of two spaces is 
given by 

(ei ® /i, 62 ® /2)e®f = (ei, e2)E /2)f • 
Now, let £ and J be fewnomial spaces on some complex manifold 
M. Then, £ ® 3^ is a fewnomial space on the product M x M, where we 
interpret (e ® f){xi,X2) = e{xi)f{x2). A classical fact on reproducing 
kernel spaces allows to recover the kernel of the tensor product: 

Theorem 12 (Aronszajn). The direct (=tensor) product posseses 
the reproducing kernel 

Ke^s^ ((xi, X2), (yi, 2/2)) = Ke (xi, yi) {x2, 1/2) 



This is [21 Theorem I p. 361]. Theorem II ibid gives us a convenient 
notion of 'product' for reproducing kernel spaces with same domain: 
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Theorem 13 (Aronszajn). The kernel K^^x,!/) = K^{x,y)Ky{x,y) is 
the reproducing kernel of the class 9 of restrictions of all functions of 

the direct (=tensor) product E ® F to the diagonal set Mi = {{x,x) : 
X G M} ~ M. For any such restriction, \\g\\ = min ||5f'||£;ig)_p, the 
restriction of which to the diagonal set Mi is g. 

If £ and 5" are spaces of fewnomials on M, we denote by £5" the class 
S described above. As an inner product space, S is just the orthogonal 
complement of the kernel of the restriction operator 

A: £(g)3^ ^ 0(M), 

9' ^ 9^9\M' 

The inner product of 9 is by definition the inner product of £ (8) 9^ 
restricted to (ker A)-*-. 

Given orthormal bases {eajaeA and {fb)beB of £ and 3^, we can pro- 
duce an orthonormal basis of 9 as follows. 

First, we notice that (cq (8) fb){a,b)eAxB is an orthonormal basis of 
£0 3^. 

Let ~ be the equivalence relation oi A x B given by 

(a, b) ~ (a', b') if and only if ea{x)fb{x) = ±ea'{x)fb'{x). 

Let C = For each equivalence class c e C (as a subset of 

Ax B), choose a pair (a, 6) e c and set 

9c = ^/¥c eafb- 
Lemma 14. {gc)cec is an orthonormal basis ofS- 
Proof. Define 



Gc = -^== CTabCa <8) fb 



fSc 

where aab = ±1 with sign choosed so that 

\/#C aabCafb = gc- 

The (Gc)cec are linearly independent, hence {gc)cec is a linearly 
independent set. 

In order to show that the span of {gc)cec is ker A-*-, let 

V //„6e„®/b e kerA. 



{a,b)eAxB 



This implies that 



ceC V {a,b)ec 
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and hence for all c, 



Reciprocally, let G G ker A-*-. Write 

G= ea ® /ft- 

{a,b)eAxB 

By hypothesis, G ± (Jab^a ® /b — Ca'b'Ca' ® /ft' whenever (a, 6) ~ (a', 6'). 
Therefore, G is a linear combination of the Gc- 
Finally, it is easy to check that 



{Gc, Gc') 



when c 7^ c' 

1 when c = c'. 



□ 



Lemma 15. Let M be fixed. The product of fewspaces of M is associa- 
tive and comutative. If one introduces the 'constant' fewspace 3 = {1}, 
then fewspaces on M are a semigroup. 

Example 16. Let M = C" and let CPi be the space of affine func- 
tions in n variables. To make it an inner product space, we assume 
that (l, orthonormal basis. We define inductively 

"Pd+i = Vd"?!- Here is the orthonormal basis of Td'- 




ao,...,a„>0 



Above, the multinomial coefficient 

d \ d\ 



Oq, ai, . . . , a„y oq! ■ ■ ■ ctn' 

is the number of ways to distribute d = + ■ ■ ■ + an balls into n + 
1 numbered buckets of size ao, . . . , a„. It is also the coefficient of 
x\^X2^ ■ ■ ■ x"" in (1 + xi + ■ — h Xn)'^- This corresponds to the unitarily 
invariant inner product defined by Weyl [33j, also known as Bombieri's. 
The reproducing kernel of ^d is easily seen to be 

Kd{y^, y) = (1 + xivi H h XnVnY. 

With the same formalism, we can also retrieve the multi-unitarily 
invariant inner product for the space of roots of multihomogeneous 
polynomial systems introduced by Rojas [32] . 



ON THE EXPECTED NUMBER OF ZEROS OF NONLINEAR EQUATIONS 11 

Example 17. Let A C (Z)" be finite, and M = (C^o)"- Let 3^^ be 
the space of Laurent polynomials of the form 



Assume arbitrary weights Ca > so that ||a;a||i = Ca- Then, 

irA(x,y) = 5^Cax"r- 

The A-th power 3"^ of 3" a is precisely J'b with B = \ Conv(yl) n Z" 
and weights 

Cb = ^ ^ Caj ' ' ' ■ 

ai-\ ha;i=b 

An orthonormal basis is (c^^^^x^). 

5. Proof of the main results 

Proof of Lemma [7} Let £ and be fewspaces on a complex manifold 
M, and let S = £^9^. By Theorem [El 

irg(x,y) = K£(x,y)^ify(x,y). 

By (|2]), we deduce that 

Now, we just insert the formula above in Theorem [TOl □ 

Proof of Theoreml^ Let S = 3^^3^^=^ ■ • • 3"^- Let Ui be the Kahler 
form associated to the space 3^i. The form associated to S is 

= AiWj, H h A„W5„ 

Because the Ui are 2- forms, they commute. Theorem [TO] implies that 

= 4^ / WgA---Awg 

^ AiiAi2---Ai„— / Ui, 

h,...,i„e{l,...,n} "^^^ 

= -^n-^i2 ■ ■ ■-^^n%le^■,,,...,/ne:F.„(^^M(f))■ 

^l,...,^„e{l,.••,n} 

In the last line above, the monomial A1A2 ■ ■ ■ A„ appears n\ times. Its 
coefficient is therefore 



A ■ ■ • A w,- 



n 



! %e5,,,...,/„e5,„(nM(f)). 
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□ 

6. Explicit calculation of the number of zeros 

6.1. The example in the introduction. We start by the bound on 
the expected number of roots of in the introduction. Let £ denote 
the fewspace of functions on the disk D = {zgC: \z\ <1} spanned by 
1 and e^. We assume that 1 and form an orthonormal basis. Then 

An easy computation is now 

_ g2Re(2) 

uJe = dd log Ke.{z,z) 



The following numerical approximation was obtained by Steven Finch 
using Mathematica. It was independently checked by this author using 
long double IEEE arithmetic. 

E/e£(n/(T))) = tt^M u = 0.202, 918, 921, 282 • • ■ . 
Jv 

It is obvious that Ef^zy^ijiD^f)) = J-n^Td ~ ^f^- Hence, 

E/6£y,(r2/(D)) = TT^W u + Uj>, = d/2 + 0.202, 918, 921, 282 ■ • • . 

Jv 

6.2. An n-dimensional example. We consider now systems where 
each equation is of the form 

Ef ™ai™a2 ™ci„ bixiH \-b„x„ 
Ja,h-^l ^2 ■ ■ ■ X„ e 

and the sum is taken for all < < ci and bi = 0, 1. The corresponding 
domain will be the polydisc D". 
The fewnomial space is 

Let CO = g{z) ^^^dz; Adz be the Kahler form corresponding to (£J'd). 
Then from Th|T2]and ([2]), we deduce that 



i=l 

Hence, 



/HI 



'^S = ^^9{zi) -^dzi A dzi. 



%,...,/.es(r^f(2)")) = TT- / < 

= n!(rf/2 + 0.202, 918, 921, 282 
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6.3. An unmixed example. We consider now the case wliere tlie first 
equation belongs to 9 = (fiJ'fii)'^" as above, but the other equations 
polynomials of degree ^2, ■ ■ ■ ,dn in each variable (they belong to CP^*^. 

Then, let :K = S^'T^l ' ' ' • ^^^^ ^^^^ 

rj_£ gAirpAidiH hA„(i„ 

From the previous example, 

— + AiO.202, 918, 921, 282 

2 

The coefficient in A1A2 ■ ■ ■ A„ is 

d^d2 — dn i! ^^-- -^" 0.202, 918, 921, 282 ■• • 

2*1 2"^ 

By Theorem [S] 

%ieg,/2eJ'2,-,/neT„^^D(f) = 

d\d2 ■ ■ ■ dn I d2 ' ' ' d 



n\- 



2" 



+ n- 11— r^O.202, 918, 921, 282 
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